PARAPRODUCTS IN ONE AND SEVERAL PARAMETERS 



MICHAEL LACEY AND JASON METCALFE 



Abstract. For multiparameter bilinear paraproduct operators B we prove the estimate 

B : D> x L q h-> L r , Kp,q<ao. 

Here, 1/p + l/q = 1/r and special attention is paid to the case of < r < 1. (Note that the 
families of multiparameter paraproducts are much richer than in the one parameter case.) 
These estimates are the essential step in the version of the multiparameter Coifman-Meyer 
theorem proved by C. Muscalu, J. Pipher, T. Tao, and C. Thiele [10, 11]. We offer a different 
proof of these inequalities. 



1. Introduction 

Our subject concerns the Coifman-Meyer theorem in a multiparameter setting. Namely, 
for bounded function r : M d — > C, we set 

„ n 

T(J u ...,f d )(x) = / T(Oe^ + " + ^Hm j )dZ 1 ...dt d 

in which fj are Schwartz functions on M and g denotes the Fourier transform, namely 

g{e) = J g{x)e»* dx. 

One is interested in conditions under which T extends to a bounded multilinear operator 
on a product of LP spaces. And the motivation for this paper is the Theorem 

1.1. Theorem. Suppose that r obeys the estimates 
(1.2) ^r(0 <ir a = IIy-% \*\<N, 
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where £ = (£1, • • • an d likewise for the multiindices a. There is a finite choice of N so 
that for all 1 < Pj < oo, with not all pj being equal to infinity, the operator T extends to a 
bounded linear operator 

T : L Pl x • • • x L Pd i-> U 

where 1/r = X^=i l/Pr 

In the statement of the Theorem, A < B means that A < KB for some unspecified 
constant K. 

This theorem has been the subject of wide ranging investigations since the inital results of 
Coifman and Meyer [3,4]. The methods and techniques of the proof, built around the subject 
of paraproducts, is the main focus of this article. 

The singularities permitted in r in (1.2) invoke elements of the product theory of max- 
imal functions, singular integrals, and related subjects. Some cases of the theorem above 
were found by Journe [8], following the identification of product BMO by S.-Y. Chang and 
R. Fefferman [1,2,5]. 

The possibility that the image space U can have index less than one is the primary new 
contribution of C. Muscalu, J. Pipher, T. Tao, and C. Thiele [10, 11]. The purpose of this 
article is to give a somewhat different proof, one that discusses end point issues, and is a 
little more leisurely than the cited articles. 

The method of proving these inequalities is by way of paraproducts. And we take the 
the latter as the primary focus of this article. See the next section for a definition of the 
most familiar paraproducts. Many proofs of paraproduct results depend upon the Calderon 
Zygmund decomposition, which has only weak analogs in the product theory. 

A very nice feature of the work of Muscalu et al. is that they find that the proof of the 
theorem can be understood in terms that avoid the intricacies of the product BMO theory of 
Chang and Fefferman. We find that some aspects of that theory enter into different endpoint 
estimates, such as at p = 1, where L 1 should be replaced by the Hardy space H 1 , and p = oo, 
where the BMO space enters in. 

The main result is Theorem 4.32 below, a discrete form of the Theorem above. This 
theorem is contained in [10, 11], and our proof borrows elements of theirs. We offer the proof 
as it differs in some details. In addition, the rich family of multiparameter paraproducts is 
not necessarily well understood. We hope that this paper advertises [10,11] in particular, 
and, more generally, the subject of multiparameter paraproducts. 1 



A substantial part of the difficulties in [7,9] is attributable to the variety of paraproducts in the multipa- 
rameter setting. 
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In the next section, we discuss the one parameter paraproducts. The proofs in this case 
feature initial details that can be used in the multiparameter case. We present proofs of 
these results in the special case of paraproducts formed from Haar functions. The subject 
of mutliparameter paraproducts is taken up in Section 4. Our presentation and proofs in 
Section 3 have been influenced by the CBMS lectures of C. Thiele [12]. 

2. One Parameter Paraproducts 

For an interval J, we say that tp is adapted to I iff \\1pW2 — 1 and 

(2.3) |D>(aO| <|/|- n - ^(l + ^p) _JV , n = 0,l. 

Here, c(J) denotes the center of /, and TV is a large integer, whose exact value need not 
concern us, except to say that its value can depend upon the IP inequalities that we are 
considering. 2 D denotes the derivative operator. We shall consistently work with functions 
which have L 2 norm at most one. Some of these functions we will also insist to have integral 
zero. (Terminology for this will be introduced below.) 

Intervals will most typically be dyadic, and we use the notation D for these intervals. To 
be specific, 

V = {\ 3 2\{ 3 + l)2 k ) : j,kEZ}. 

With the control on the function and its derivative in the definition of adapted, elements of 
Littlewood-Paley theory will apply. Namely, we will have the inequalities (2.16) — (2.18) for 
the square function constructed from the functions {ipi : I G T>}. 

Operators are built up from rank one operators / t— > (f, ip)<p'. A paraproduct is, in its 
simplest manifestation, of the form 

2 

b(/i,/ 2 ) = Ei j rWn^^>- 

lev j=i 

Here, the functions tpjj, for j = 1,2,3 are adapted to /. Two of these three functions are 
assumed to be of integral zero. We should emphasize that each individual summand is of the 
form 

2 

U1J2) l-> \I\~*V3,I Yl(fj,<Pj,l)- 

3=1 

This is certainly a bounded operator from, say, L 2 x L 2 — ► L 1 , and our desired conclusion 
is that the same is true for the sums above. 



It will be clear in the sequel that TV = min(3pi + 4, 3p2 + 4) is sufficient for the bilinear case, for example. 
The main size requirement can be found in (4.67). 



4 MICHAEL LACEY AND JASON METCALFE 

We will also consider higher linearities 

n 

(2-4) B(/i, f 2 , . . . , f n ) d ^ Y) 1 ^ Vn + i,i Hifi, Vv) 

lev j=i 

where the functions (pjj are assumed to be adapted to I and two are of integral zero. In the 
course of the proofs, it is convenient to consider the n + 1 sublinear forms 

n+1 

(2-5) A(/i, fz,..., fn+i) = El J l"^IIl<^^>l- 

lev j=i 

Notice that this just assigns a number to the n + 1 tuple of functions and that it dominates 
(B(/i, f 2 , ■ ■ ■ , f n ), /n+i)- It is also of interest to consider the related sublinear operator 

"n+1 

(2.6) L(A,/ 2 ,...,/ n ,/ n+1 ) ^ j>r^ ni^'^ 

lev [j=i 

In particular, if L maps a product of Banach spaces into L 1 , then we conclude that A is 
bounded on a related product of spaces, see (2.20). 

2.7. Theorem. For n > 3 and 1 < pj < oo, define - = Y^j=i ~- Then, 

(2.8) B : ^ n -\lP — ► U. 
In addition we have the endpoint estimates: 

(2.9) B : (gi^L 1 — ► L^'°° 

(2.10) B : ® s j=1 L x x ®]zl~ s BMO — ► 

In this last display, we require that the functions tpjj have integral zero for any choice of j 
for which fj is only assumed to be BMO. 

The estimates above follow immediately from the corresponding estimates for the sublinear 
operator, in the case that the index of the range is between 1 and oo. Namely for 1 < r < oo, 
this can be derived from 

2.11. Theorem. For n > 3 and 1 < pj < oo, define ~ = Y^=i V ^ ^ s ^he case that 
< s < oo, then 

n 

(2.12) im/i,/ 2 ,...,/„)ii.< nii-fiu- 

In the case that any pj = 1, then L l can be replaced by H 1 , and the estimate above is true. 
If we do not replace L l by H 1 , then only the weak type inequality is true. In particular, we 
have the estimate 

(2.13) L : ®™ =l L 1 — >L 1/n >°°. 
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In the case that any of pj equal oo and the functions (pj t i have integral zero for all 7, then 
the space L°° above can be replaced by BMO. 

The essential case is that of n = 3 above, and to avoid unnecessary notations, that is the 
case discussed in the proof. Thus, we have three functions ipjj. Two of these are assumed 
to be of integral zero. Due to the symmetry of the estimates we are to prove, we can assume 
that these two functions occur for j = 2, 3. 

This in particular means that we have the estimate 

(2.14) sup Ux) l{fl ^ l)l < M(A)(x) 

lev 



where M denotes the maximal function. There is another bound that applies to the second 
and third functions. Namely, we set 

(2.15) s,, d = f [E^nTf^r- 

lev ' ' 

It is a consequence of the integral zero assumption placed on the functions y? 2 ,/ and <p 3> i that 
the usual Littlewood-Paley theory applies to these square functions. Therefore, they map all 
LP into themselves, for 1 < p < oo, and we have the usual endpoint estimates. To be explicit, 
these estimates are 

(2.16) l|Si/||p<||/|| P , Kp<oo, 
(2-17) \\^fh<\\f\\m, 

(2.18) \\Sj f\\ B Mo< imicc 

Note that as we are using the maximal function and square functions, we have access to 
the following upper bound for numerical sequence 

3 

(2.19) ^J^Oj,n< ||fll,n||£° o ||02,n||^ 2 |l a 3,n||^ 2 . 

n j=l 

2.1. Generalities on the Proof. If 1 < p. < oo and — + — + — = 1, we can estimate, 

rJ Pl P2 P3 

based on (2.19) and Holder's inequality, 

J WiJ2)hdx< J L(fuhJz) dx 

(2.20) < J (M h) ■ (S 2 f 2 ) ■ (S 3 f 3 ) dx 

< I|MA|U||S 2 / 2 |U||S 3 /3|U 



< 



ll/llUII/ 2 || P2 ||/3|| P 3- 
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This argument also applies when p\ = oo. When, however, any of the Pj = 1, one should 
replace the L l norm on fj with the H l norm. 



oc 
oo 



The argument must be modified when, e.g., P2 = oo. For then the square function 1 1 fi 
is no longer bounded. And indeed, the sharp estimate on the square function replaces L 
with BMO. 

Alternate methods are required when duality cannot be applied. Here, we shall obtain 
inequalities of weak type. For example, 

(2.21) A|{B(A,/ 2 ) > A}|^ < ||A|U|/ 2 |U i + i = i 

Interpolation will then supply the strong type inequalities, except for the endpoint estimates. 

As the class of operators B we consider are invariant under dilations by powers of 2, this 
inequality follows from 

(2.22) |{B(/i, / 2 ) > K}\ < 1 

where K is an absolute constant, and the inequality holds for all choices of smooth compactly 
supported functions fj with L Pj norm 1. 

The usefulness of this observation is already evident in that we have the following (obvious) 
estimate 

(2-23) ||/<7||i/2,oo < H/lkooMkoo- 

This inequality immediately generalizes to general products and indices. We use this gener- 
alization below. 

One can effectively use the symmetry in the formulation of the paraproducts in passing 
to the sublinear function A, and considering weak type inequalities for it. Namely, for 1 < 
Pi,P2 < 00, we define ps by + — + — = 1. In particular, p 3 can be negative: For 
Pi = P2 — 1) we have i = — 1, which we interpret as the dual index to |. Let X(E) be the 
space of functions supported on a measurable set E C R and bounded by 1. We then prove 
the inequality 

2 

(2-24) \A(fi,h,f 3 )\ < I^r /P3 nil^ll^ /s G X ^)- 

3=1 

Observe that this implies (2.22). Also observe that the inequality for A follows from the 
following formulation: For all E 3 , we can choose E' 3 C E 3 with \E' 3 \ > \\E 3 \, and 

2 

(2-25) |A(/i,/ 2 ,/ 3 )| < lE^llWf^ heX(E' 3 ). 

3=1 
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By dilation invariance, it suffices to prove this estimate in the case that = ||/2||p 2 = 1 

and \E 3 \ = 1. All of these comments apply equally well in the multiparameter case. 

2.2. H 1 and BMO. We will restrict ourselves to the dyadic versions of the real Hardy space 
H 1 and its dual BMO. 

The Haar functions are 

(2.26) h J = \i\- 1 ' 2 (i I _-i I+ ), lev, 

where J_ (J + ) is the left (right) half of I. These functions form a basis for L 2 . The dyadic 
square function from Haar functions is formed as follows. 

def 



lev 



We define the real dyadic Hardy space H 1 as those functions / with 

WfWm = H/IK + HS/IK <oc. 

The dual to H 1 is BMO. This space has the equivalent norm 

(2.27) \\f\\ BMO = suphji-^KM,)! 2 ] 172 . 

These spaces are substitutes for L l and L°°. In the current setting sharp endpoint estimates 
can be phrased in terms of these norms. And there is a rich interpolation theory between 
these spaces. 

3. Proofs in the One Parameter Case 

The case of Haar paraproducts is the only case that we consider in the one parameter case. 
The rationale is the proof in the multiparameter case includes the one parameter case as a 
special instance. In addition, the Haar case is especially attractive, due to the presence of 
the dyadic grid. 

A particular way that it enters is this. Suppose that X is a collection of disjoint dyadic 
intervals, not necessarily a partition of R. We define the conditional expectation with respect 
to X as 

E (/|z) W S (IT 1 /,/<»)* xe J\ Ie r X ' 

(fix) x£{J IeX I 
We leave it as an exercise that these properties of the conditional expectation are true. 
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(1) Integrals are preserved under conditional expectation: J f dx = / E(/ |X) dx. 

(2) / i — > E(/ | X) is a projection. 

(3) / i — > E(/ | X) is of norm one on all L p , 1 < p < oo. 

(4) / i — > E(/ | X) is bounded as a map from dyadic H 1 into itself. 



We first turn to the range of inequalities for the sublinear operator L and the proof of 
Theorem 2.11. Observe that by (2.19), we have 

(3.28) L(/i,/ 2 ,/ 3 ) < M/ r S/ r S/3 

Here, we assume that we have mean zero in the second and third places, and we continue 
with this assumption below. To be specific, the sublinear operator is 

L(/i,/ 2 ,/3) = ^i/r 3/2 i/i(/i,i^i)(/ 2 ,M(/3,Mi- 

iev 

For fx, we form the inner product with the absolute value of the Haar function. The inequal- 
ities in (2.12) then follow from Holder's inequality, provided that all we are not discussing an 
endpoint estimate. When s < 1, one can instead apply an appropriate version of (2.23). 

If any pj = 1, then we only conclude that Mfj and Sj fj are in L 1,oc . But we can apply 
(2.23) to conclude the weak type estimate. If any pj = 1 and fj e H 1 , then we conclude that 
both Mfj and Sj fj are in L 1 , so that again Holder's inequality or (2.23) will apply. 

We concern ourselves with the endpoint estimates where either of P2,P3 is infinity and L°° 
is replaced with BMO. One class of inequalities are in fact easily available; they are 

L : L°° (g) BMO &H 1 — ► L 1 , 

(3 29) 

v ' ' L : L°° <g> BMO ® BMO — ► BMO. 

Notice that these estimates can be interpolated by standard linear methods. 

Since the Haar functions are an unconditional basis for both H l and BMO, s we can 
conclude that 

J Uf u f 2 j 3 ) dx= l K/a.MlK/3,Ml 



lev 



< 



II/2II BMO 11/31 H 1 ■ 



This proves the first bound. 



3 While we are specifically appealing to the properties of the Haar functions here, this aspect does generalize 
to the non-Haar functions. 
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For the BMO estimate, for each dyadic interval J we have 



1/2 

/cJ V \ J - 1 j=i "jcJ 

2 

< 1^1 H/llloo JJll/jllfiAfO- 

J'=l 

This concludes the proof of the estimates (3.29). 

The last estimates to prove are these: 

L : L P1 <S> BMO ® L pz — ► L s , ± = i + i. 

At this point we make a more substantive reliance on the dyadic structure. The strategy is 
first to prove the weak type inequalities, and in particular (2.24). Namely, we will choose 
an exceptional set on which we will not attempt to estimate L(/i,/2,/3) and a conditional 
expectation to apply to /i, after which we will have a bounded function in the first coordinate. 
But then we will be in a situation for which we can appeal to the estimates in which we have 
duality. 

We will prove that L satisfies (2.24). Thus, fix fj functions in the appropriate spaces, of 
norm one. Define 

E = {Mfx > 1}. 

We do not attempt to estimate L on this set. That has the practical implication that we 
need only consider the sum 

MA) = El^-'^pK/^MIK^Mlii 



(Recall that we are assuming that f 2 and f% are fixed.) Let I be the collection of maximal 
dyadic subintervals of E, and set g\ == E(/i | X). 

Note that by construction we will have the estimate ||<?i||oo < 2. For otherwise, let J be 
the smallest dyadic interval that strictly contains I (i.e. the parent of /), and observe that 
/j|<7| dx > \J\. That is, we contradict the maximality of /. 

In addition, for each dyadic interval / not contained in E, we have J 7 fi dx = jj g dx. 
Thus, we have 

(3.30) L E (f 1 )=L E (g 1 ). 
Therefore, using (3.29), we can estimate 

|{L(A, / 2 , / s ) > 1}| < \E\ + \{L( 9ll f 2 , f 3 ) > 1}| 

<l + [|bl||oo||/2||BMo||/3|U] P3 



10 MICHAEL LACEY AND JASON METCALFE 

<1- 

Our discussion of the estimates in Theorem 2.11 is complete. 

Let us turn to the bilinear operator B(/i, f 2 ) and the proof of Theorem 2.7. In the inequality 
(2.8), if the index r of the target space is between 1 and 00, then we can appeal to duality, 
as is done explicitly in (2.20). 

We discuss the proof of the weak type bounds for B, in the case that duality does not 
apply, namely ~ < r < 1. Marcinkiewicz interpolation will then deduce the strong type U 
inequalities. 

In so doing, we need only prove (2.22), and we will repeat the use of conditional expectation 
in the argument (3.30) above. Take fj e L Vj of norm one, for j = 1, 2. Suppose that we have 
in fact H/jlloo < 1- We conclude that in fact \\fj\\ q < 1 for all Pj < q < 00, and so for q > 4 
large, we can use the proven bound of L g x L q into L q l 2 to conclude that 

|{B(/ 1 ,/ 2 )>/i}|<l. 

The general case can be reduced to this situation. 

Define 

E= \J{Mf 3 >l}, F^ f {Ml £ >I}. 

3=1 

Clearly, the set F has measure bounded by an absolute constant. We will not estimate 
B(/i, fz) on the set F. Define 

B F (/i, h) = ^|/r 1/2 M/i, l^/IX/2, hj). 

And set X to the collection of maximal dyadic intervals contained in F. We set gj == E(/j | X) . 
Then, certainly we have Hs^Hoo < 1- We claim that 

(3.31) B F {g 1 ,g 2 )l Fc = B{f 1 J 2 )l Fc . 

And this will complete our proof. 

Suppose that / is a dyadic interval that is not contained in F. The Haar function associated 
to / is constant on the two sub halves of /, which we denote as I±. By our definition of F, 
neither I± can be contained in E, hence we have 



/ fj dx = gj dx. 
Ji± Ji± 



This proves (3.31), and so we have completed the proof of the norm bounds for B. 
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We now consider paraproducts formed over sums of dyadic rectangles in M. d . The class of 
paraproducts is then invariant under a d parameter family of dilations, a situation that we 
refer to as "multi-parameter." 4 

Let us say that a function ip is adapted to a rectangle R = ® d =l Rj iff ip(x±, . . . ,Xd) = 
n,'=i fj( x j)j with each tpj adapted to the interval Rj in the sense of (2.3). 

Our paraproducts are of the same general form 

n 

B(f 1 ,f 2 ,...,f n ) = ^2 **£% Y[(Jv,<Pv,r)- 

Ran \K\ 2 v=i 

Here, we let 1Z == T> d be the class of dyadic rectangles. With the obvious changes, we will 
also use the notations for the sublinear forms and operators given in (2.5) and (2.6). 

The Theorem in this setting is 

4.32. Theorem. Let n > 2 and 1 < p v < oo for 1 < v < n, and define - = 5^"=i — • Assume 
that for each choice of coordinate 1 < j < d, there are two choices of 1 < v < n + 1 for which 
we have 

(4.33) / ip Vt R(xi,X2, ■ ■ ■ , x n ) dxj = 0, for all Xk with k ^ j and all R . 
Jr 

Then, we have the inequality 

(4.34) B : ® n v=l L Pv — ► U. 

Assume that the functions tp Vj R satisfy (4-33) for all j. In the instance that p v = 1, the 
inequality remains true if we replace L 1 by H 1 defined below. In the instance that p v = oo, 
we can replace L°° by the larger space BMO = (H 1 )* defined below. 



The critical distinction comes from the assumption about the zeros, (4.33). Let us say that 
there are Xj zeros in the vth position iff 



(4.35) 




, . . . , Xd) dxj = for all Xk with k ^ j. 



And so our assumption is that for each 1 < j < d there are two choices of v for which we 
have zeros in the vth. position. 



In this paper, the number of dimensions will be the number of parameters. In general, the two are however 
distinct. Consider R dl <E> M. d2 and rectangles in this space formed from a cube in each space M. dj . 
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One would expect that in the Theorem above the space L°° could be replaced by the larger 
product BMO space. This is a more delicate consideration, one that we remark upon at the 
end of the paper. 

Again, the critical case is n = 2, so that B is bilinear. There are essentially d distinct cases. 
The first case, with the greatest similarity to the one parameter case, is where we have, for 
example, Xj zeros in first and second positions for all 1 < j < d. The other cases do not have 
a proper analog in the one parameter case. 



4.1. H 1 and BMO. We turn to the product Hardy space theory, as developed by S.- 
Y. Chang and R. Fefferman [1,2]. This section is not strictly speaking needed, but does 
inform the modes of proof below. 

H 1 (C^) will denote the d-fold product real valued Hardy space. This space consists of 
functions / : ~R d — > R where M. d is viewed as the boundary of 

= ® d j=1 {z e C : Re(» > 0}. 

We require that there is a function F : — ► C that is holomorphic in each variable 
separately and 

f(x)= lim Re F{xx + %,..., x d + iy d ). 

\\y\\-*o 

The norm of / is taken to be 

II/IIh 1 = Km---\im\\F(x 1 + iy 1 ,...,x d + iy d )\\ L i {R d ) . 

Vi 10 y d l0 



Product H 1 (C^) has the equivalent norm 

(4-36) 11/11* d = f + \\SfW, 

where S is the square function formed over the product Haar basis 

\(f,h R )\\ H/a 



g/dj f ^ 

R&TL 

For a rectangle R = Ylj=i e ■^ >d ' we have set 



R\ 



d 

h R (x u ...,x d ) = Ylhn^ixj). 

3=1 

The last product is over one dimensional Haar functions as in (2.26). The basis {Hr : R E 
V d } is the d-iold tensor product of the Haar basis. 

The dual of H^C*.) is BMO(C^), the d-fold product BMO space. It is a Theorem of 
S.-Y. Chang and R. Fefferman [2] that this space has an explicit characterization in terms of 
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the product Haar basis. In particular, Chang and Fefferman showed that the product BMO 
space has the equivalent norm 



I BMO = SUp 

r/cM d 



\U\- iy £\(b,h R ) 



RcU 



1/2 



where it essential that the supremum be formed over all subsets U C M. d of finite measure. 



4.2. The Governing Operators. We describe a range of operators, which encompass the 
d parameter maximal function at one end and the d parameter square function at the other. 
These operators, as we shall see, govern the behaviors of these paraproducts. 

To be explicit, in the two parameter setting, these operators are as follows. First we have 
the maximal function, 

MM / d ^ f sup l -^0-l R , 
Ren y\R\ 

which is a variant of the strong maximal function. 

The reason for the iterated style notation becomes clearer with the second type of governing 
operator. It is 



c A/r t dcf r |(/, ^ (1) xfl C2) )| 
SiM 2 / = > sup D | — 1 



R (1) eP %) eX) 



1/2 



R = i?(i) x R(2). 



In order for this to be a bounded operator, the functions {tpa} must have zeros in the first 
coordinate. But then, the operator will be bounded on all L p 's for 1 < p < 00. There is also 
the operator S2Mx in which the role of the coordinates is changed. 

A third type of operator is 

M,S 2 / sup [ £ KW-WI 2 ! 1 1 '\ R = Rm x Rm . 

The functions {v_r} must now have zeros in the second coordinate. And there is a corre- 
sponding operator M2S1 in which the roles of the coordinates are reversed. 



A fourth type of operator is the familiar two parameter square function 

ss / d = f \J2 ^Alf^ii 



Relz 



Here, we require that the functions have zeros in both coordinates. As with the maximal 
function M M, the subscripts are not needed in this case. 
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In general, we set Tj to be either the square function S or the maximal function M, both 
formed over a set of functions {ifj : I G T>} acting on the jth coordinate. For a permutation 
of the coordinates ir : {1, . . . , d} i— > {1, . . . , d}, set 

(4.37) T d ^ T n[1 yT n{d) . 

The subscript ir(j) indicates in which coordinate the operator T n (j) operates. In each position 
in which coordinate T n (j) is the square function, we require that the functions {(p^ : R G 71} 
have zeros in that coordinate. Note that these operators can be viewed as 
(4.38) 

R = R { 

* '\R\ 



T/(x) = 

where a(j) is either 2 or oo for all j 



x • • • x R( d ) G 1Z 



The point of these definitions is that for all paraproducts B of d parameters, there are 
three choices of T&, k = 1,2, 3, operators as in in (4.37) for which we have 

3 

(4.39) 



<B(A,/ 2 ),/ 3 > < / f[T k (fk)dx. 
J k=i 



This is a consequence of the essential hypothesis on there being two zeros in each coordinate. 
In those two positions, one uses the square function. In every other position, the maximal 
function is used. 



L p bounds for the operators T. Let us discuss the mapping properties of these operators, 
beginning with the maximal operator. We have been careful to insist that the functions <pn 
are products of adapted functions. Thus, in the two parameter case, we can appeal to the 
one parameter maximal function twice, as follows. 

||M 1 M 2 /|| P <||M 1 {M 2 /}|| P 

< \\M 2 f\\ p 

< II/IIp- 

Likewise, by a d fold iteration of this argument, it follows that 

||M---M/|| P < \\f\\ p , Kp<oo. 



The same estimates are true for the square function, but are not as straightforward to 
deduce. 

4.40. Lemma. Assume that the functions {y?i?} have zeros in every coordinate. Then we 
have the inequalities below 

(4-41) l|S-"S/|| p <||/|| p , Kp<oo, 
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(4-42) IIs-s/ii^ii/Hhi. 

At the L°° endpoint, the correct estimate is 
(4.43) suplC/r 1 ^|{/,^)| 2 < 



15 



RCU 



The supremum is formed over all subsets U <zM. d of finite measure. 

Proof. We should consider the one parameter inequality 
(4-44) ||S/|| a <||/||a, 

as this will explain in part the assumptions used in the definition of adapted (2.3). 

Consider first the inner product (ipi, (fij) for two dyadic intervals |/| < | J\. Using the fact 
that ipi has integral zero and that (fj admits a control on it's first derivative, we estimate 

Pi 1 , J) = \{<Pi,<Pj)\ 

< I |v9/(x)[v9j(x) - <pj(c(I))] \ dx 



<© 3/! (i + 



W)-<J)\ 
\J\ 



Here recall that c{I) is the center of /. For this particular argument we only need N = 3, 
say, though other parts of the argument require higher values. 



Now, observe that we have 



Pi 



dcf 



sup 



J:\I\<\J\ 



P2 



dcf 



sup p(I, J) < OO. 



I:\I\<\J\ 



To prove (4.44), observe that by duality, it suffices to prove the estimate 

1/2 



E 



< 



E 



aj\ 



i i 

Assume that the right hand side is one, and estimate the square of the left hand side as 
follows, with a generous use of the Cauchy-Schwarz inequality. 



|^a/y?/ <2^2\a I a J \p(I,J) 

i \i\<\J\ 

<[E| E \*mi,J) 



I J:\I\<\J\ 



1/2 
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1/2 



^ 2 [E{ E MV(/^)}{ E p( J > J )} 



/ J:|7|<|J| J:|/|<|J| 

< 2pip 2 - 

This completes the proof of (4.44). 

The proof of (4.41) in the case of p = 2 follows from an iteration of the one parameter 
result, just like the argument for the maximal function. 

The H 1 to L 1 estimate is an easy consequence of the definition of the H 1 norm in (4.36). 

The last estimate (4.43) is not as accessible, as it relies upon a "localization lemma" 
Lemma 4.48. Fix a set U C M. d of finite measure and a function / bounded uniformly by one. 

Set To to be the d parameter maximal function, and define a sequence of functions fk by 
taking 

r def p _ 

JO = / J-{Toli7>2-i}- 

And then inductively define fk, k > 1 by 

(4.45) f + ... + f k = /l {Tolp>2 -i- fc} . 

Of course we have 

EK/o,^)l 2 <|c/| 

RCU 

by the L 2 boundedness of the square function. By Lemma 4.48, for integers fc>lwe have 
(4-46) El^'^l^ 2 ^! 

RCU 

for an appropriate choice of integer iV in the definition of adapted, (2.3). As this last estimate 
is summable in k, the proof is complete. 

□ 



The case of general operators T is treated in this proposition. 

4.47. Proposition. All possible operators T as in (4-37) map L p into itself for all 1 < p < oo. 
This holds provided the functions : R G 71} have zeros in each coordinate in which T 
is equal to a square function. The norm depends only on the constants that enter into the 
definition of adapted in (2.3). 

Proof. In the two parameter case, observe that 

M!S 2 < S 2 Mi, 
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the inequality holding pointwise. More generally, for any operator T as in (4.37), we can make 
the operator larger by moving all maximal functions to the right of all square functions. Thus, 
it suffices to bound operators of the form 

T = Si---S„M w+ i..-M (i . 



Recall that the maximal function is bounded as a vector valued map from L p (£ 2 ) into itself 
for all 1 < p < oo. This is a well known result of C. Fefferman and Stein [6]. Namely, we 
have the estimate 



\^2\M---Mg n \ 



1/2 



< 



1/2 



For a function / G L p (M. d ) and a dyadic rectangle R v G T* v , set 

Ir v (-^Ij • • • j •^vi^v+li ■ ■ ■ i %d) 



def 



ifR u (x 1 , ...,x v ) / f(x±, . . .,x v ,x v+1 , . . . : x d )tp Rv (x 1: ...,x v )dxx.. .dx v . 



It is a consequence of the one variable Littlewood-Paley inequalities that we have 

1/2 



~ II 



To conclude, observe that we have 



T/< [ J2 \M v+ i---M d f Rv \ 

R v eT> v 

The L p norm of the last quantity is clearly bounded by 



1/2 



V 



□ 



The proof of Theorem 4.32 for a particular range of indices can now be given. Suppose 
that we are seeking to bound the paraproduct B from L P1 ® L P2 into U where 1 < r < oo. 
Then, by (4.39) and the previous lemma, we have 

3 

(b(/i,/ 2 ),/ 3 > < nn T ^iu 

k=l 



where by abuse of notation we take to be the conjugate index to r. 



The remainder of the theory that we develop is needed to address the case in which the 
paraproduct does not obey a duality estimate. 
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4.3. A Localization Lemma. We will need an estimate which refines the L 2 estimates for 
the operators T proved in Proposition 4.47. 

We will make a further definition in which these operators, defined as as mixture of sums 
and supremums over rectangles, are restricted to a subset of rectangles. Thus, if O C 1Z, and 
T = SSS, for instance in the three parameter case, we set 



\(f^ R )\ 2 - lVa 



OJ " ^ \R\ ±R 

Of course here we insist that the function <pr have zeros in each coordinate. More generally, 
to define To, in the expression (4.38), we restrict the rectangles to be in the collection O 
rather than all possible rectangles. 

4.48. Lemma. Suppose that O C 1Z and that there is a constant \x > 1 so that for a function 
f, 

(4.49) Bupp(/) n pR = 0, R E O. 

Then it is the case that we have 



(4-50) llWlh^/^'ll/lls. 

The exponent N' is a function of only the exponent N in the definition of adapted, (2.3). 

Proof. This lemma is a corollary to the proof of 1? boundedness of the operators Tj, and to 
deduce it, we will rely upon a degree of flexibility built into the definition of adapted. 



E 

2 

Lemma would be obvious. Thus, the problem at hand is one of Schwartz tails 



If we knew that the functions tp^ were supported on, e.g., ^R, then the conclusion of the 



We make a further specification of the definition of adapted, (2.3), which is applied to 
functions on the real line. Fix a constant K and an integer N say that <p is (K, N) -adapted 
to an interval / iff 

|D>(x)| <K\I\- n -*(± + te=^y N , n = 0,l. 
In addition, the I? norm of ip is at most one. 

Say that ip is (K, N) -adapted to a rectangle R = ® d j =1 R(j) C M d iff tp is a product 

d 

ip(x ll ...,x d ) = Y[ipj( 

3=1 



with each ipj (K, iV)-adapted to R^ . This definition naturally extends to collections of 
rectangles. 
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Now, fix K and iVo so that Proposition 4.47 holds for all functions {ifn : R G TZ} which 
are (K , A" )-adapted to TZ (and have zeros in the right coordinates). Here, we take K > 1 
so that there will be no difficulties with having the functions ipn be of L 2 norm one. 

For N' as in the conclusion of the Lemma, set Ni = N + N'. Consider functions {<pn : R G 
TZ} that are (K , A r 1 )-adapted to TZ. And let T be the corresponding operator constructed 
from these functions. 

Observe that we can define a new set of functions {<f>R : R G TZ} that are (2K , N ) 
adapted to TZ, and satisfy 

<Pr{x) = {J> N '<Pr(x), x G" fiR. 

In those coordinates 1 < j < d where there is no zero, this is accomplished by multiplying 
by a smooth function that is zero on a large neighborhood of R, and and identically fi N '/ d in 
M. — fiR(j)- If the coordinate has a zero, observe that 



j <p Ru) ( Xj ) daJ < 2K \R U) \ 1 / 2 ^)- N ^ 1 

J M. — uR( t i 



provided Ni > 2. And so we can set <PR (j) in a neighborhood of R to cancel out this integral. 

The operator T constructed from the functions {ipR : R G TZ} will satisfy an L 2 bound 
that is independent of \i. For a function / as in (4.49), we have 

fi N 'T f = T f. 

And the right hand side admits an L 2 bound independent of \x and N', so the proof is 
complete. 

□ 

We will also need the following corollary to the previous Lemma. 

4.51. Corollary. Let O be a collection of rectangles whose shadow has finite measure. If f 
is a bounded function, we have the estimate 

(4-52) \\T f\\ 2 <\sHO)\^ 2 \ 



Proof. Let / G L°° be bounded by one, set U = sh(0), and define fk, for k > as in (4.45). 
We shall see that 



(4-53) ^||T M| 2 <|sh(0)| 



1/2 



Indeed, applying Lemma 4.48, we see that 

l|ToM| 2 <2^' fc |sh(0)| 1 / 2 
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where we can assume that N' > 4 say. □ 



4.4. The Proof of Theorem 4.32. We only treat the bilinear case of the theorem, as 
the higher order linearities are easy to accommodate into this proof. We also restrict our 
attention to the two parameter setting. Straightforward modifications adapt the argument 
to an arbitrary number of parameters. The first cases that we consider are those in which B 
is to be mapped into a space U with | < r < 1. 

Some of the generalities of the proof of the one dimensional case remain in force in the 
current setting, in particular, it will suffice for us to establish (2.25). That is, we shall 
demonstrate this: For all fj G L Pj of norm 1 and set E 3 C M 2 of measure one, there is an 
open subset E' 3 C E 3 of measure at least ~, so that for / 3 a smooth function compactly 
supported in E' 3 and with L°° norm at most one, we have 

(4-54) El^l^fll^'^)!^ 1 - 

R j=l 

Moreover, it suffices to take fj in a dense class of functions, and so we take f\ and f? to be 
smooth and compactly supported. 

Observe that as all fj smooth and compactly supported, the sum above is at most 1 if the 
sum is restricted to rectangles that have at least one side length either small or large, as in 
these cases the inner products above decay rapidly. 5 Thus, we can assume that the sum is 
restricted to a finite number of rectangles, and we should provide an estimate for the sum 
that is independent of the exact number or nature of the rectangles. 

This last sum is over positive summands. It will be useful to us to organize the sum over 
appropriate subcollections of TZ. For a collection of dyadic rectangles O set 

def V- TT \ (fj,<Pj,R)\ 



(4.55) Sum(C) = ^2\R\\ 

R(zO j = l V l Jl -| 

where the functions cpjn are associated with the bilinear paraproduct that we considering. 

We will be working with different collections of rectangles O. The shadow of O is defined 
to be 

def 



sh(O) = |J R. 



R&O 



'With the precise definition of small and large depending upon the functions fj . 
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The Definition of E' 3 . Let Tj, for j — 1,2, 3, be the three operators as in (4.39). (Though at 
this point the function / 3 is not yet specified.) For the sake of symmetry, set T to be the 
maximal operator in 2 parameters. 

Define 4z/ = f min(p 1 ,p 2 ) ) and set 



(4.56) Q j4 = {Tj fj > k2 £ }, £ G Z, j = 1, 2 

2 

(4.57) n e = \Jn j>e , 

3=1 

(4-58) fi d = f IJlTol^i^}, 



(4.59) n d = f {T l n >|}. 

In these definitions, we fix a value of ft ~ 1 so that |Q| < |, and then take E 1 ^ = £3 fl f2 c , so 
that the measure of this set is at least ~. This is possible, since we can estimate, using the 
L 2 bound for the maximal function and the L Vi bounds for the T~ 



-31 



|0|< J]|{Tol qi >2-« rf }| 
< J ftT 1 ^2 w |^| 



^ J ^EE k " Pj2(21/ " Pj) '- 
And the last sum is less than \ for a fixed k ~ 1. 



T/ie Decomposition ofTZ. We decompose the collection of all rectangles. A rectangle i? is in 
iff £ is the greatest integer such that 

(4.60) |-Rnn^| = \Rn{Tjfj > k2 £ }\ > ^\r\. 

Here, we are extending the definition in (4.56) to j = 1, 2, 3 and to £ G Z. 

Observe that as each fj is smooth, it is necessarily the case that Tj fj is a bounded 
function. Thus, the definition above makes sense, and for each j, every rectangle R G TZ will 
be a member of some Oj t g, for £ G Z. 

For integers f= (4,4,4) e O d = (-N) x (-N) x Z, we define 

(4.61) (9/= f fVv, 
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This is not a complete decomposition of the collection of rectangles, a point we return to 
below. 

We appeal to the principal technical estimate, proved below. Observe that, by Lemma 4.70 
and using the notation of (4.55), it is the case that 

(4.62) Sum(0 f ) < 2 <1+ ' 2+ * , |sh(0 r )|. 

We therefore need effective estimates for the shadow. Since each Tj operator is bounded 
on all LP spaces, we of course have the estimate 

|sh(C f )| < min2-^ 



(4.63) 



j 



Here, £1,^2 < while £ 3 £ Z, and 9j > with 81 + 8 2 + 9 3 = 1. Recall that pi and P2 are 
specified to us in advance, but as f 3 is a bounded function on a set of finite measure, we are 
free to take any value of 1 < ^3 < 00 that we wish. In particular, it is effective to take p 3 to 
be relatively close to one for £ 3 > while we take ^3 large for £ 3 < 0. 

The sums are treated separately based on the sign of the last coordinate of I £ O. Com- 
bining (4.62) and (4.63), we see that 

(4.64) ^2 Sum(0 £ -) < ^ 2 ei ^ 1 ~ Pl9l)+h( - 1 ~ P202)+e ' i( ' 1 ~ P39 ' i \ 

te(-N) 3 £*G(-N) 3 

We should choose < Q\ < — , and < 60 < — so that Q\ + 8 9 < 1. We are then still free to 

PI Z P2 1 z 

chose p 3 > 1, but close enough to one so that ^3^3 = p 3 (l — 9 1 — 9 2 ) < 1. Thus, this last sum 
is no more than a constant. 

Let us consider the case of £1, £ 2 < while £ 3 > 0. The minimum in (4.63) occurs for j = 3, 
and we have the estimate 

(4.65) Yl Sum(C f ) <2^ + ^" £3fe - 1) . 
This clearly sums to a constant for p 3 sufficiently large. 



Some rectangles are not in the classes defined above. To treat the remaining cases, set 

(4.66) V^O Ui nO v „ 

£= {£iA) e P = - (-N)} x {Z - (-N)}. 

This decomposition does not take the role of f 3 into account, and so our next steps are to 
deduce information about this function. 
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Suppose that R G Vf Then either £ 1 or £ 2 must be positive. Suppose that l\ is. By (4.58), 
it is then the case that, by definition, R C Q, but moreover 

2 u ' h RnE! 3 = 

where 2z/ = v. 

Now, the function / 3 satisfies the conditions of (4.49), with O = P^-and /i = 2 v ' ll l 2 . Then 
by Lemma 4.48, we have 

(4.67) ||T 3 ^/3||2<2^/ 2 <2" 1 ^ 
for appropriate choice of N in (2.3). 

Note that we have proved the inequality 

\\T 3>Pr f 3 \\ 2 < min(2- 1M \2- 10 ^). 
We can then apply (4.73) to see that 

Sum(TV) < 2 £l+ ^ min(2- 10 ^, 2" 10£2 ). 
This is clearly summable to a constant over the indices P, as either l\ or i 2 must be positive. 

This proof will permit e.g. p± — 1, with the additional hypothesis that j\ G if 1 , and that 
all functions ifi t R satisfy (4.33) for all coordinates 1 < j < d. By duality, this implies the 
BMO estimate of our Theorem. 

The Endpoint Estimates. The endpoint estimates concern the case when, say, p 2 = oo, 
which is a case not handled in the discussion above. (Note that assuming that f 2 G L°°, we 
do not need to make additional assumptions about the zeros of the functions <P2,r-) 

We again prove (4.54). And the method of proof is quite close to the argument above. Use 
the same notation as in (4.56), but now define 

(4-68) Q d = { U{T l^>i±o2^}, 

(4.69) n = {T ln> 1 2 }. 

We take E' 3 = E3 R fi, so that again we have \E' 3 \ > |. 

We define the sets Oj £ as in (4.60) but we shall only use this for j = 1, 3. Set (in contrast 
to (4.61)), 

o r =o 1A no 3/3 , e = (hj 3 ). 

We then have the estimate below, as a consequence of Corollary 4.51 and (4.73), 

Sum(0 f ) < 2^^|sh(0 / )| 1 / 2 ||T f 2 \\ 2 
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<2^|ah(CV)|. 

We estimate the shadow. 

|sh(O f )| < min(2- pA ,2- p3£3 ) 

This follows on the one hand from the assumption that f\ G LP 1 . But recall that we can 
choose 1 < p 3 < oo in an arbitrary fashion, as f 3 is bounded by one and supported on a set 
of measure at most one. 

Pulling these estimate together, we see that 

Sum(£> £ -) < 2 h{1 ~ e ^ )+ii{1 - e ^\ 

where the 9i, 9 3 are non- negative and sum to one. The index pi is specified to us, but p 3 can 
be taken arbitrarily. For I G (— N) x Z, we should take p 3 close to one for l 3 < 0, but p 3 = 4, 
say, for £ 3 > 0. Doing so we see that 

Smn(O r ) < 1. 

fe(-N)xZ 



We turn to the case where i\ > 0. As before, we should now gain additional information 
about the function f 3 . But the reasoning of the previous section, and in particular (4.50) 
and (4.67), leads us immediately to 

llTq/n Alio < 2~ 10piil 

II ■i,Ui > e 1 Jo\\2 no ^ 

On the other hand, Corollary 4.51 implies that 

l|T 2 ,o Ml / 2 || 2 <|sh(0 1 ,,)| 1 / 2 <2-^. 

Appealing to (4.74), we see that 

Sum(aj<2^- 10 -^ 2 ), 
which is clearly summable over l\ > 0. 



The Principal Technical Estimate. In this section we isolate the principal technical estimate 
in proof of Theorem 4.32. 

4.70. Lemma. Suppose that for three constants < Xj < oo, j = 1,2,3 and a collection of 
rectangles O we have 

(4.71) \R^{^jf3>\j}\<^\R\, R^O, J = 1,2,3. 
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Then we have the estimate 

3 

(4.72) Sum(0)<|ah(0)inV 

j'=i 

Suppose that (4-71) does not hold for j = 3. Then we have the estimate 

(4.73) Sum(O) < A 1 A 2 |sh(0)| 1 / 2 ||T 3jO / 3 || 2 . 

Suppose that (4-71) does not hold for j = 2 and j = 3. Then we have the estimate 

(4.74) Sum(O) < A 1 ||T 3)0 / 2 || 2 ||T 3 ,o/3||2- 



We will apply this in settings in which we have a good estimate for the shadow of O in 
terms of the Xj. 

Proof. Set 

3 

w = sh(0)nf]{T j f j <\ j }. 

3=1 

Then RnW has measure at least This permits us to restrict the range of integration 

below to W. 



Sum((9)< / £n^%^l^ 

JW Dr -„ „-_i V \R\ 



< 



ReO j=l 
3 

n t j rfx 



^i=i 



<ish(0)inv 

3=1 

This proves our first conclusion. The remaining conclusions follow from the same reasoning, 
with the use of the Cauchy-Schwarz inequality. □ 
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